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Abstract: Due to the effort of a number of authors, the value c u of the absolute constant 
factor in the uniform Berry-Esseen (BE) bound for sums of independent random variables 
has been gradually reduced to 0.4748 in the iid case and 0.5600 in the general case; both these 
values were recently obtained by Shevtsova. On the other hand, Esseen had shown that c u 
cannot be less than 0.4097. Thus, the gap factor between the best known upper and lower 
bounds on (the least possible value of) c u is now rather close to 1. 

The situation is quite different for the absolute constant factor c nu in the corresponding 
nonuniform BE bound. Namely, the best correctly established upper bound on c nu in the iid 
case is about 30 times the corresponding best known lower bound, and this gap factor is greater 
than 100 in the general case. In the present paper, improvements to the prevailing method 
(going back to S. Nagaev) of obtaining nonuniform BE bounds are suggested. Moreover, a 
new method is presented, of a rather purely Fourier kind, based on a family of smoothing 
inequalities, which work better in the tail zones. As an illustration, a quick proof of Nagaev's 
nonuniform BE bound is given. Some further refinements in the application of the method are 
shown as well. 
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1. Uniform and nonuniform Berry Esseen (BE) bounds 

Suppose that X±, . . . ,X n are independent zero-mean r.v.'s, with 



S := Xi + • • • + X n , A := Yj E I Ai| 3 < oo, and B := JJ] E |Xj| 2 > 0. 

Consider 

A(z) : = | P(5 > Bz) - P(Z > z)\ and r L := A/B 3 , 

where Z ~ N(0, 1) and z ^ 0; of course, ri is the so-called Lyapunov ratio. Note that, in the "iid" 
case (when the Xj's are iid), will be on the order of \jyfn. 
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In such an iid case, let us also assume that EXf = 1. 

Uniform and nonuniform BE bounds are upper bounds on A(z) of the forms 

c u r L and c nu 1—^-5- , (1.1) 
1 + z- 5 

respectively, for some absolute positive real constants c u and c nu and for all z $s 0. 

Apparently the best currently known upper bound on c u (in the iid case) is due to Shevtsova 
[39] and is given by the inequality 

c u < 0.4748. (1.2) 
On the other hand, Esseen's example [8] with iid AYs, n — > 00, z appropriately close to 0, and 

P(Al = 1 - PEss) = PEss = 1 " P(Ax = -pEss) (1.3) 

with pess := 2 - VlO/2 = 0.4188... showed that c u cannot be less than ^77= = 0.4097 . . .; a similar 
lower bound on the BE constant for intervals was recently shown by Dinev and Mattner [6] to be 

= 0.7978 .. ., which is almost twice as large as 0.4097 

Thus, the optimal value of c u is already known to be within the rather small interval from 0.4097 
to 0.4748 in the iid case (in the general case the best known upper bound on c u appears to be 
0.5600, due to Shevtsova [40]; a slightly worse upper bound, 0.5606, is due to Tyurin [44]). 




2. The Bohman Prawitz Vaaler smoothing inequalities 

To a significant extent the mentioned best known uniform BE bounds are based on the smoothing 
result due to Prawitz [38, (la, lb)], which states the following. There exists a nonempty class of 
functions M : R — > C such that 

M(t) =0 if \t\ > 1 (2.1) 
and for any r.v. X, any real T > 0, and any real x, 

<8(M T {-#)Ee iX *){x) < P(A < x) - \ < P(X < x) - \ < <3(M T (#) E e iX *) (x), (2.2) 

where 

M T (#) := Af(#/T), (2.3) 

0(/)(x):=-p.v.J^ e-^/(t) T , (2.4) 

and p.v. stands for "principal value" , so that p. v. ^ := lim e ;o ( \_ e A + J ) ; here and subsequently, 

A|oo 

the symbol # stands for the argument of a function. Of course, the upper and lower bounds in 
(2.2) must take on only real values; this can be provided by the condition that 

Mi := ReM is even and M 2 := ImM is odd. (2.5) 

Note also that the upper and lower bounds in (2.2) easily follow from each other, by changing X 
to —X. 

Inequalities (2.2) may be compared with the corresponding well-known inversion formula 

P(X < x) + \ P(X = x) - \ = <8{Ee tX *){x) (2.6) 
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for all real x; see e.g. [10, (2)]. The multiplier M{jf) of the c.f. E e lX & in (2.2) is the Fourier transform 
of the function M(#) := i ^ e - u *M(t) dt, which may be considered as a smoothing kernel - 
since, in view of (2.1), the spectral decomposition of M does not have components of frequencies 
greater than 1. So, the factors M(+#/T) in the bounds in (2.2) filter out the components of the 
function (5(E e tX &) of frequencies greater than T and thus make the function smoother and flatter, 
especially if T is not large enough. Another way to look at such smoothing is through the Paley- 
Wiener theory, which implies that the Fourier spectrum of a function is contained in the interval 
[— T, T] iff the function is (the restriction to R of) an entire analytic function of exponential type T 
and hence rather slowly varying if T is not large; see e.g. [7, Section 43] . On the other hand, from 
an analytical viewpoint, the presence of the factors M(+#/T) is useful, because one then needs to 
bound the values Ee ttx of the c.f. of X only for t e [—T,T], which is a much easier task unless T 
is too large. 

One particular function M for which (2.2) holds is given by the formula 

M(t) = [(1 - \t\)nt cot TTt+ \t\ - |*|) vr*] I{|t| < 1} (2.7) 

for all t # [38]; here and subsequently, it is tacitly assumed that the functions of interest are 
extended to by continuity. For this particular multiplier M, which was shown in [38] to have 
a certain optimality property, the corresponding smoothing kernel M(#) := ^ $ R e^ lt ^M{t) dt is 
given by the formula 

. , . 27rxsinx (2vr(x + 2vr) - xV (£)) - (1 - cosx) (x 3 V>" O) + 4vr 2 (x + 4tt)) 
Mix) = . o q 

47T 3 X 3 

M(x) 

for x ¥= 0, where ^ is the digamma function, defined by the formula 
iff(z) = T'(z)/T(z); this kernel is (necessarily) asymmetric and alternat- 
ing in sign; also, M(x) dx = M(0) = 1; a part of the graph of this 
kernel M is shown here on the left. 

Earlier, inequalities of the form (2.2) were obtained by Bohman [3] for another class of functions 
M, with apparently not quite as good approximation properties. Another approach to Prawitz's 
results was demonstrated by Vaaler [45] . 




3. Nonuniform BE bounds: Nagaev's result and method 

The classical result by Nagaev [19] is that in the "iid" case 



P(S > z^i) - P(Z > z)\ < c nu E } X ^ (3.1) 

(1 + z 6 )^/n 



for all real z > 0, where c nu is an absolute constant. Bikelis [2] extended this result to the case of 
non-iid Xi's. Nagaev's method involves the following essential components: 

• truncation; 

• Cramer's exponential tilt, together with a uniform BE bound; 

• an exponential bound on large deviation probabilities. 

(v) (v) 
First, truncated versions of Aj, say , are obtained, such that X\ ^ y for some real y > and 

all i (the r.v.'s may, in some variants of this approach including [19], be improper in the sense 
that they may take values that are not real numbers). The truncation is done in order to make 
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the exponential tilt and an exponential inequality possible. The value of the truncation level y is 
chosen (i) to be large enough so that the tails of the truncated sum S (v) := X [ y) + ■ ■ ■ + X n y) be 
close enough to those of S and, on the other hand, (ii) to be small enough so that the exponential 
tilt and the exponential inequality result in not too large a bound. In some variants, including the 
ones in [19, 2], two different truncation levels are used. 

In view of the uniform BE bound, without loss of generality z ^ zq, where zq is an arbitrarily 
chosen positive real number. Two main cases are then considered: 

Case 1: zq < z < c-y / ln(y / n/ E |Xl| 3 ) ("moderate deviations"); 
Case 2: z > zq v c\/h^JvjE\X\\^) ("large deviations"); 

here c is a positive constant. 

In Case 1, of moderate deviations, the exponential tilting is performed, which may be presented 
as follows. Take some real h > and let X\ = x[ h ' v \ . . . , X n = Xn' y ^ be any r.v.'s such that 

for all bounded (or for all nonnegative) Borel-measurable functions g: IR n — > M. Equivalently, one 
may require condition (3.2) only for Borel-measurable indicator functions g; clearly, such r.v.'s Xi 

do exist. It is also clear that the r.v.'s X, are independent. These r.v.'s, the JQ's, may be referred to 

(v) 

as the tilted or, more specifically, /i-tilted versions of the X- s. Clearly, without the truncation, the 
tilted versions of the original r.v.'s X{ may not exist, since Ee hS may be infinite even if E |A^| 3 < oo 
for all i. Using (3.2) with g(xi, . . . , x n ) = e ~ h ( XlJ[ hx ") l{x\ + ■ ■ ■ + x n > x}, it is easy to see that 

rco 

P(SM > x) = Ee hsiv) I du he- hu P(x < S < u) (3.3) 

Jx 

for all real x, where S := X% + • • • + X n . Similarly, one can write 

P(BZ > x) = E e hBZ du he~ hu P(x < BZ + B 2 h ^ u) (3.4) 

Jx 

for all real x, since any /i-tilted version of the r.v. BZ has the distribution N(B 2 h, B 2 ). At that, 
good choices for y and h are of the form ax and r]x/B 2 , for some real parameters a and rj in (0, 1). 

So, to bound | P{S^ > z^n) - P(Z > z)\ (cf. (3.1)), one can demonstrate sufficient closeness 
of the terms Ee hS and P(x < S < u) in (3.3) to the corresponding terms Ee hBZ and P(a; < 
BZ + B 2 h ^ u) in (3.4). For each i, one notices that 

E \ X ^fe hX i V) ^e^E|X,| 3 (3.5) 

and then shows that E e hx i^ is close enough to 1 and, somewhat more precisely, to E e hZ ^ E x t , 
and that the mean and variance of X^ are close enough to h EX 2 and EX 2 , respectively. So, one 
shows that E e hS v) is close to E e hBZ , and the first two moments of are close enough to those 
of BZ + B 2 h. Using now a uniform BE bound as in (1.1) - but for the X^s rather than the X^s, 
one shows that P(x < S < u) is close enough to P(x < BZ + B 2 h < u). 

In Case 2, of large deviations, instead of the exponential tilting and a uniform BE bound, one 
employs an exponential inequality to bound P(S^ > x) and hence P(S^ > x) — P{BZ > x) from 
above; for the lower bound on the latter difference, one simply uses — P{BZ > x). 
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3.1. A historical sketch of the problem of nonuniform BE bounds 
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The constant factors c nu in the mentioned papers [19, 2] were not explicit. All papers known to this 
author with explicit values of c nu followed the scheme of proof given by Nagaev [19], as delineated 
above. 

Apparently the first such explicit value of c nu was greater than 1955, as reported by Paditz [24]. 
In his dissertation [23], a much better value, 114.7, was presented, according to [21]; this author 
has not been able to access [23]. 

Michel [15] showed that in the iid case c nu ^ c u + 8(1 + e), which would be less than 30.2211, 
assuming the mentioned value 0.4748 for c u , obtained in the later paper by Shevtsova [39]. 

Tysiak [43] stated c nu to be no greater than 32.88. However, as was pointed out in [22], the 
expression g3(x) defined in [43, page 42] is missing a factor equal x 8 , and this expression is used 
with x > 3.4172, so that the missing factor is greater than 3.4172 8 > 18593. Even though this factor 
is missing only in one of the 5 summands comprising the final bound in [43], the overall impact of 
this omission could still be very large. 

Mirakhmedov [16] claimed that Michel's constant, c u + 8(1 + e), remains valid even without the 
iid assumption. However, as was pointed out by Wolf in his review of [16] (MR0780094), there 
was a mistake in [16], which used the Lyapunov ratio for the original r.v.'s X{ instead of that for 
the tilted truncated ones, Aj. There appear to be a number of other mistakes and gaps in [16] as 
well. In particular, it appears that the term h 2 B 2 /2 should not be on the right-hand side of [16, 
(10)]. Also, when using [16, (10)] to get [16, (12)], it is unclear how the term 00.1876 from [16, 
(10)] disappears in [16, (12)], given that L n can be arbitrarily small. It is also unclear how one 
could get [16, (11)] from [16, (7) and (8)]. The subsequent letter to the editors [22] claimed that 
one could use the methods of the mentioned papers by Tysiak [43] and Mirakhmedov [16] to show 
that c nu =5 32.1530. However, all the details for this claim in [22] are given in just four lines, and 
the displayed inequality there is incorrect, apparently due to a typo and mistaken editing (as the 
comparison with the manuscript of the letter shows) . 

Later, Paditz [25] claimed that c nu < 31.935. However, the proof in [25] contains a gap too. 
Namely, the case c n ,5 )Qj/ g(x) < K 2 < x 2 is not considered in [25], where c n ^ a p{x) is a certain 
expression defined there. It appears likely that, when this missing case is considered, the value of 
the constant factor will go up. In the iid case, Gavrilenko [9] claimed (instead of the bound of the 
form c nu L/(l + z 3 )) the nonuniform bound 22.7707(L + 1/Vn)/(1 + z 3 ) sc 2 x 22.7707L/(1 + z 3 ). 
However, the proof in [9] has the same gap as the proof in [25]; also, the proof in [9] relies on the 
mentioned work [43]. Again in the iid case, Nefedova and Shevtsova [20] briefly stated that they 
had gone along the lines of the proof in [25] except using a better value for c u (namely, 0.4784, 
obtained in [13]) in place of such a value (namely, 0.7915 [41]) used in [25], to get 25.80 for c nu ; it 
is unclear if they fixed the gap in [25] or even noticed it. 

Finally, once again in the iid case, Nefedova and Shevtsova claimed in [21] that c nu < 18.2. 
However, there appears to be an error there as well. Namely, the first inequality in [21, (14)] is 
equivalent to the reverse of the last inequality on page 75 there, which latter is in turn equivalent to 
the condition x 2 ^ c n (x; 5, a, b, c) in [21, Theorem 1], which is also equivalent to the second display 
on [21, page 75]; the expression c n (x; 5, a, b, c) is defined in the first display on page 70 of [21]. So, 
for any given A, n,fe, c, <5 satisfying all the conditions of [21, Theorem 1], the first inequality in [21, 
(14)] and the last inequality on [21, page 75] can both hold only for one value of a. This wrong 
inequality in [21, (14)] is also used for [21, (16)]. 

This leaves, for now, 114.7 as the best (possibly correctly) established nonuniform BE constant 
factor c nu - in the general, non-iid case. On the other hand, it follows from a result by Chistyakov 
[5, Corollary 1] that c nu is necessarily no less than 1, and this lower bound on c nu is asymptotically 
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exact in a certain sense for z — > go. Apparently, this has been the best known lower bound on c nu . 
However, it is easy to improve this bound slightly and show that necessarily 

c nu > 1.0135; (3.6) 

this can be done by letting X\ have the centered Bernoulli distribution with parameter p = 8/100 
and then letting n = 1 and z \ 1 — p. However, it was shown by Bentkus [1], the best constant 
factor for n = 1 will be 1 if 1 + z 3 in (3.1) is replaced by z 3 ; it is also conjectured in [1] that the 
same constant factor, 1, will be good for all n. 

Thus, in the non-iid case the apparently best known lower bound on c nu is over 100 times smaller 
than the best established upper bound on c nu , and this gap is about 30 times in the iid case. So, 
the situation with c nu is much worse than that with c u . 



3.2. Possible improvements of Nagaev's method 

A crucial component of the mentioned method offered by Nagaev [19] and used in the subsequent 
papers [2, 24, 23, 15, 43, 16, 22, 25, 9, 20, 21] is an exponential inequality. However, the exponential 
bounds used in all of those papers are not the best possible ones. An optimal exponential bound, 
in terms of the first two moments and truncated absolute third moments of the X^s was given 
by Pinelis and Utev [37]. In fact, the paper [37] provided a general method to obtain optimal 
exponential bounds, along with a number of specific applications of the general method. 

However, even the best possible exponential bounds, say for sums of independent r.v.'s, can be 
significantly improved. The reason for this is that the class of exponential moments functions is 
very small (even though analytically very simple to deal with). Using a much richer class of mo- 
ments functions, Pinelis [34] obtained the following result. Let X\, . . . ,X n be independent random 
variables (r.v.'s), with the sum S := X\ + • • • + X n . For any a > and 9 > 0, let T a 2 and Tig 
stand for any independent r.v.'s such that T a 2 has the normal distribution with parameters and 
a 2 , and has the Poisson distribution with parameter 6. Let also IIq := Hq — EILg = Hg — 9. 
Let a, y, and f3 be any positive real numbers such that e := -J^ e (0, 1). Suppose that 2?, E^f < 

a 2 , Yii ^ EX, ^ 0, and X; L < y, for all i. Let % )C r )2/ := r( 1 _ e ) cr 2 + ytl £a 2/ y 2. Then it is 

proved in [34] that 

Ef(S)<Ef( V£ , a>y ) (3.7) 

for all twice continuously differentiable functions / such that / and /" are nondecreasing and 
convex. A corollary of this result is that for all x e M 

P(5 > x) < inf E{ J' a ' y ~f + ^ c 3 ,o P LC (Ve,.,y > x), (3.8) 
te(— oo, x) [X — t)° 

where 03,0 := * 4.46 and the function Rai^ P LC ( ? ? ^ x ) is defined as the least log-concave 
majorant over R of the tail function Ksih* P(j] ^ x) of a r.v. rj. The bounds in (3.7) and (3.8) 
are much better than even the best exponential bounds (expressed in the same terms). 

A trade-off here is that the bounds given in (3.8) are significantly more difficult to deal with, es- 
pecially analytically, than exponential bounds. However, this can be done, as shown in the following 
discussion. In accordance with what was pointed out above, one needs an exponential bound (or a 
better one) only in Case 2, of large deviations, when z ^ zq v cy / ln(y / n/ E |Ad| 3 ), which implies 

E|X!| 3 /V^^ e- z2/c \ (3.9) 
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Also, by (3.8), for any real r < 1 

oo vj 
j=0 h 



where 



x := Bz = z^/n, y = ax, a e (0, 1), Qj := E(Z + + , 

/ 17~ Ely a (Xi) + ) 3 / r \ a 

ai := yl-a^/a, a := -™ , Uj - := a(j - - - AJz, A := -3. (3.10) 



a J cr 

„2 



Assume now that rc ^ 2, where c is as in (3.9). Since z ^ zq, one has 



Qo^C ^l, (3.11) 



where 

C :=e^ 2 E(Z-rz ) 3 + ; (3.12) 

here one uses the fact that e pt E(Z-t)% is decreasing in t ^ provided that f3 ^ 1/2; in fact, this 
decrease is fast, especially when /3 < 1/2. Note also that E(Z - t)\ = (t 2 + 2) ip(t) - t (t 2 + 3) 
for all real t, where (p and $ are the density and tail functions of Z. 

Next, since Eg{f3Z) is nondecreasing in f3 ^ for any convex function g, 

Qj < (ai/zo) 3 E(Z + Ujo)%, where Ujo := a(j — r/a — X)z . 

Using now the identity E(Z + t)+ = t 3 + 3t + E(Z — t)\ for all real t and the decrease of E(Z — i) 3 
in t e R, one has 

00 V F I X I 3 

^Q^e-^C^a)-^, (3.13) 



, 3- ^ J V n 



where 

3 00 \ j 1 \2— 1 

C ^ := fl>i° + 3%o)— + E(Z - u 10 ) 3 + + E(Z - u 2Q )l—e x 



with A as defined in (3.10). The sum in the above expression of C\(a) is easy to evaluate 
explicitly. Also, since the left-hand side of (3.1) can never exceed 1, without loss of generality 

(3.14) 

with a max = l/c nu ; working a bit harder, one may assume (3.14) with a max significantly smaller 
than l/c nu . Next, it appears that for values of the parameters a and r that have a chance to be 
optimal or quasi-optimal, the factor C\{a) will be decreasing in a e [0, a max ]. Therefore and in view 
of (3.11) and (3.13), one will have 

P{s (y) > x )< C ° ( |f 1 r ( ) °3 +) (3.15) 



with Co as in (3.12) and Ci(0+) = (a^o)~ 3 E(Z + (a — t)zq)\. One can improve the above estimates 
by partitioning the interval [0, a max ] into a number of smaller subintervals and then considering 
the corresponding cases depending on which of the subintervals the value of a is in. 
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Thus, it is shown that P(S^ > x) can be appropriately bounded using the better-than-exponential 
bound in (3.8), and at that in a rather natural manner and incurring almost no losses. It should be 
clear that the expression on the right-hand side of inequality (3.15) will become a term in a bigger 
expression that is an upper bound on the left-hand side of (3.1). That latter, bigger expression will 
then have to be (quasi-) minimized with respect to z$, a, rj, t, and the other parameters, subject 
to the necessary restrictions on their values. 

Also, one can use ideas from [17, 18, 35, 36] to improve the estimation of the effect of truncation, 
as compared with the way that was done in the mentioned papers [19, 2, 24, 23, 15, 43, 16, 22, 25, 9, 
20, 21], as well as more "synthetic" ways to bound moments of the tilted distribution - cf. results in 
[30, 29, 33, 32]. In addition, as in [21], one can use the uniform bound 0.3328(E |A"i| 3 + 0.429)/Vn 
from [39], which is smaller than the previously mentioned bound of the classical form c u L = 
c u E iXip/y'n with c u = 0.4748. There are a few other potentially useful modifications. Thus, the 
improvements concern every one of the three major ingredients of Nagaev's method listed beginning 
on page 3. By utilizing the above ideas, one may hope to improve the upper bound on c nu to about 
10 in the iid case and to about 12 in the general case. When and if such an objective is attained, 
the gap between the available upper and lower bounds on c nu will be decreased, at least, about 3 
times in the iid case and about 10 times in the general case. 

However, significant further progress after that seems unlikely within the framework of the 
method of [19]. One of the main obstacles here is the factor e hy as in (3.5). Since good choices 
for y and h turn out to be ax and rjx/B 2 with a and r\ somewhat close to 0.5, this factor will then 
be something like e 2 / 4 , which is large for large enough z. 

Yet, the factor e hy is the best possible one in (3.5) (even assuming that X\ v) = Xi and hence x\ y) 
is zero- mean). Such a large factor is necessary when Xi has a two-point distribution highly skewed 
to the right. On the other hand, certain considerations suggest that the least favorable situation 
in Case 1 of moderate deviations is when n is very large but z is not so, and then the mentioned 
least favorable distribution (for the uniform BE bound) given by (1.3) is only slightly skewed. This 
creates a significant tension in using the exponential tilt. 

One may try to reduce the factor e hy by decreasing a and hence y - but this will increase fast 
the effect of truncation, which is (at least roughly) proportional to 1/a 3 . 

Even if one were able to get rid of the factor e hy altogether, the corresponding uniform BE 
bound on the rate of convergence to the probability P(x < BZ + B 2 h ^ u) in (3.4) would still seem 
relatively too large, since this probability itself is less than P(x < BZ + B 2 h) = P [Z > (1 — r])z) 
and therefore is rather small for what appears to be the least favorable values of z, such as 2.5 
to 3.5 (and values of rj typically not too far from 0.5). In contrast, the mentioned asymptotic 
lower bound by Esseen [8] (recall (1.3)) is attained for z close to 0; furthermore, the corresponding 
asymptotic expression is rather highly peaked near the maximum and is thus much smaller outside 
of a neighborhood of the maximum point. 

Yet another apparently powerful cause of tension is as follows. After the Aj's have been truncated, 
a natural bound on | P(S r ^ > x) — P(BZ > x)\, obtained via either the exponential tilt or a Stein- 
type method, decays in an exponential rather than power fashion; see e.g. the results [4] and [28], 
which imply an upper bound of the form c(A) ^^J- for real A > 0, say in the iid case. The factor 

l/e Xz decays much faster than 1/(1 + z 3 ) when z is large. However, the former factor may be much 
greater than the latter, especially if A is not large and z is not very large. For instance, if A = 1/2 
as in [4], then max 2> o / = 10.8 . . . , attained at z = 5.9719 .... 

In the next section, a new approach to obtaining nonuniform BE bounds is described, based on 
the Fourier method, complemented by extremal problem methods. 
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4. A new way to obtain nonuniform BE bounds 

Take any function h e C 1 such that (the limit) &(h) exists (and is) in M and h(t)/t —* as \t\ —* oo; 
here and in what follows, C k denotes the class of all k times continuously differentiable complex- 
valued functions defined on R. Take any real ieI. Note that &(l)(x) = \ signs - say, by (2.6) 
with X = 0. So, writing &(h) = /i(0)(8(l) + <8[h — /i(0)) and evaluating the p. v. -integral in the 
expression for <5(h-h(0)) by parts, one has 

x&(h)(x) = ^ /i(0) xsignx + i&(Ah)(x) (4.1) 

if x # 0, where the linear operator A is defined by the formula 

( A fr)(t) : = -t - Kt) - m = h (0)-[Kt) + h'(t)(-t)] (4 2) 

for t 7^ 0. In fact, identity (4.1) holds for x = as well, in view of the definitions of <3 and A. By 
induction, for all k e N := {1, 2, . . . } 

(A k h)(t) = -k\t~ k (h(0) - V h^(t)^$^) =(-l) fc C[h {k) (t)-h^(at)]ka k - l da (4.3) 
V ,to ^ ^ Jo 

if he C k and t # 0, and hence (A fc /i)(0) = 0. So, iterating (4.1), one has 

eWW-M-SEf + ^'e^K,) (4 .4) 

for all real x ^ 0, all A; e N, and functions h such that 

heC k , &{h) exists in R, and h^\t)/t -» for all j = 1, . . . fc as \t\ ->■ oo. (4.5) 
More generally, 

e(W „ - + ' £ ^'(0 + ) -^(0-) + (|) (4 . 6) 
2 Z7T f— ' liix)-) \xJ 

for all real x 7^ 0, all e N, and all functions /i such that 

heC(R), he C fc (M\{0}), <S(/i) exists in M, and 

for each j e {1, . . . A;} there exists /i (j) (0±) e R and h^ j \t)/t ^ as |t| 00. (4.7) 

The condition h e C fc (IR\{0}) in (4.7) can be slightly relaxed, to the following: 

h e C fe_1 (R\{0}) and h {k ~ l) is of locally bounded variation on M\{0}, (4.8) 

with <&{A k h){x) then understood as <5(A k h)(x) + (-l) fc 0(/j(* _1 ))(s), where (A k h)(t) := 
—k\ t- k (h(0) - S,to h^{t) ±=f) for t ^ and &(h^)(x) := £ p.v. ^ e -^ ^M . 
Identity (4.4) immediately implies 

Theorem 4.1. Take any k e N and any real T > and x > 0. Lei X be any r.v. with E \X\ k < 00. 
Let / denote the c.f. of X . Let M be as in (2.2), with the additional requirement that M e C k . 
Then 

-i k (5(A k r T ^)(x) sc x k P(X > x) < x k P(X > x) < -i fe ©(A fc r T ,+) (s), (4.9) 

where 

rr,±(#) := M T (+#)/(#). (4.10) 
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Remark. Condition E \X\ k < oo in Theorem 4.1 implies / e C k , so that (4.5) holds with g = tt + ■ 

As was mentioned, the Prawitz smoothing filter M given by (2.7) provides the tightest, in a 
certain sense, upper and lower bounds in (2.2) on the d.f. of X. However, it is not smooth enough 
to be used in Theorem 4.1 in the most interesting in applications case k = 3. Namely, that M is 
not even in C 1 - whereas one needs M e C 3 in Theorem 4.1 for k = 3. 

There are a number of ways to develop such a smooth enough smoothing filter. Some of them 
can be based on Proposition 5.1 in Section 5 of this paper; see e.g. the function M = Mo,2 given 
by formula (5.4). 

The identity (2.6) can be rewritten in the following more general and hence sometimes more 
convenient form. 

Proposition 4.2. Let L be any complex-valued function of bounded variation on R, and let £ be 

its Fourier-Stieltjes transform, so that £(t) = J" e ltx dL(x) for all real t. Assume also that L 
is regularized so that 2L{x) = L{x—) + L(x+) for all x e M and extended to [— oo,oo] so that 
L(+cc) = lim^^+oo L[x). Then 

L{x) - \[L{<x>) - L{-cc)] = <8{l){x) for all real x. (4.11) 

This follows immediately from (2.6), because (i) both sides of (4.11) are linear in L and (ii) any 
regularized function of bounded variation on R is a linear combination (with complex coefficients) 
of regularized distribution functions. 

Suppose that I: M. — > C is a function which may depend on a number of parameters. For brevity, 
let us say that the function I is a quasi-c.f. if it can be represented as a linear combination of k c.f.'s 
with (possibly complex) coefficients such that the length k of the combination and the coefficients 
are bounded uniformly over all possible values of the parameters. 

Clearly, the product of two quasi-c.f. 's is a quasi-c.f. Also, any linear combination of two quasi- 
c.f. 's is a quasi-c.f., provided that the coefficients of the combination are bounded uniformly over 
all possible values of the parameters. Moreover, one has the following simple proposition. 

Proposition 4.3. Take any natural m. Let N denote the c.f of a r.v. Y whose distribution may 
depend on a number of parameters. Suppose that E|y| m is (finite and) bounded uniformly over all 
possible values of the parameters. Then the mth derivative of N is a quasi-c.f. 

Proof of Proposition 4-3. Let us exclude the trivial case when E \ Y\ m = 0. If m is even or E Y™ = 

or EY m = 0, then i\TM(#) := EY ^Z* is a c.f., and = (i m EY m )N(™), so that is a 

quasi-c.f. In the remaining case one has EKJ 11 > and EYZ" > 0, so that one can similarly write 

ATM (#) =i m EY? +y m + {-i) m E Y™ -yL . □ 

A quick proof of Nagaev's nonuniform BE bound (3.1) can be easily obtained based on The- 
orem 4.1. Indeed, let T = CT^/n/P^, where j3% := E|A"i| 3 and ct is a small enough positive 
real constant. Let A < B mean \A\ < CB for some absolute constant C. Let X := S/y/n. 

If T < 1 then 1 < @j=. So, for all real x > 0, by the Markov and Rosenthal inequalities, 



/n 

(1 + x 3 ) P{X > x) < 1 + E|A"| 3 < 1 + ^ < % and similarly (1 + x 3 ) P(Z > x) < whence (3.1) 
follows. 

It remains to consider the case T > 1. Note that then n > (P^/ct) 2 ^ 3 and hence n > 4 provided 
that ct < l/v3. 

In view of the uniform BE bound, Theorem 4.1, and (4.3), in order to prove (3.1) it is enough 
to show that (J5(r'^ / (a#) - r'([ g (a#)) < fo/y/n and (S(r^ / (a#)) < /3 3 /y/n over a e (0,1], where 
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r j)f (#) := Mj(§ )/(#), j = 1,2, the M/s are as in (2.5), M is (say) as in (5.4), / is the c.f. of 
X := S/y/n, and <?(#) := e~# 2//2 (so that g may be considered as a special case of /). One has 

By (5.4) and (5.6), Mi is the c.f. of a distribution with a finite 4th moment, whereas M 2 = 
kM[. Hence, by Proposition 4.3, is a quasi-c.f. for each pair (j,q) £ {1,2} x {0,1,2,3}, and 

then so is M^ q \^). Similarly, / is the c.f. of the r.v. X with E|X| 3 < 1 + -% < 1, by the 

Rosenthal inequality and the case condition T > 1. So, again by Proposition 4.3, f^^ is a quasi- 
c.f. for each q e {0,1,2,3}. Thus, \?f) f( 3 -i)( a #) is a quasi-c.f. and, by Proposition 4.2, 

6(Mj 9) (^)/( 3 - g )(a#)) < 1, for each (j,q) e {1,2} x {0,1,2,3}. Therefore and because T > 1, 

®((g)^ M i 9) (^)/ M («#)) * ^ < ^ ^ for each & ?) e I 1 - 2 ) >< {W}; (4.13) 

note that g = is not included here. 

It remain to show that <5 la (f m - g'") < ^ and <£>2a{f") < where 

<M>*)0»0 := ©(M i (^)(/ i )(a#))(x). (4.14) 

For j e {0,1,2,3}, introduce f[ j) (t) := (^) j /i(t) and := f^(t/^n), where /1 denotes 

(7) (7) 

the c.f. of Ai. Similarly, starting with 51 := g in place of fx, define 5^ , and then let : = 
/in ~~ ffin ano ^ ^lrl := I fin I v |#ln|> om ^ superscripts (°) and M. Note that / = /f n and hence 
VV'" = /si + /32 + /33, where / 31 := (n - l)(n - 2)/ 1 "- 3 (/«) 3 , / 32 := 3(n - 1) f£ 2 f$ f$ , and 
/33 := /"nTVin ? do similarly with 5 and g 1 in place of / and fx. By Proposition 4.3, Mj(^)f 33 /(3 3 
is a quasi-c.f. and hence, by Proposition 4.2, 0j a (/ 33 ) < @3, for j e {1, 2}. 

So, it suffices to show that &xa(f3k — 93k) < P3 and 2 a,(/ 3 fc) < P3 for k £ {1, 2}. This can be done 
in a straightforward manner using the following estimates for j e {0, 1, 2, 3} and \t\ < T: Mi < 1, 

M 2 (|) < I < |t|/VV»> M*) n_j < e- rf2 (where c is a positive real number depending only on the 
choice of c r ), < |t|/v^, fcg(t) < 1, |dg(*)l < /3 3 (|t|/V^) 3 ^, and hence f£ j (t) - g£ j (t) < 

|i| 3 e _c * 2 /3 3 /\/n; cf. e.g. [26, Ch. V, Lemma 1]. For instance, I/31 — g 3 i\ < n 2 (D 3 xi +-D312), where 
%W := (l/iT 3 -^- 3 |(^) 3 )(*) £ We-* 2 %{^f and D 312 (*) := (/»5T 3 (42) Vi2l)(*) £ 

e_rf2 (^) 2 ^(H) 2 , so that <S la (/ 31 - 531) < Coo^ + t A )e- ct2 ?3 f < ft- □ 
Of course, the above argument is rather crude and yet it demonstrates that the method based on 
the smoothing inequalities (4.9) is quite effective. It also strongly suggests that this method can be 
used further, in order to obtain an explicit and appropriately small upper bound on the constant 
factor c nu . 

Let us now discuss some of the refinements that could be used within the general framework of 
the above quick proof of (3.1). 

There, in particular, we needed to bound 

L(H):= (\<8(H(a#))(x)-<8(H(#))]3a 2 da, (4.15) 
Jo 
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where H is of the form Mi(| )(f 3k - g 3k ) or M 2 (f )/ 3fc for e {1,2} - recall (4.9), (4.10), and (4.3). 
Tacitly, that bounding was then done using the trivial inequalities 

1 f 00 dt 

\L(H)\ ^2 sup \<3(H(a#))(x)\ ^2— |#(i)|^, (4.16) 

oe(0,l] Z7T Joo l r l 

where in turn we used the definition (2.4) of <3 and the trivial identity \e~ ltx \ = 1 for real t and 
x; the integral in (4.16) exists even in the Lebesgue sense, since \f 3 k{f) — 93k(t)\ = 0(\t\) and 
\M 2 (t)\ = 0(\t\). 

In fact, the factor 2 in the last bound in (4.16) on \L(H)\ can be removed, so that one have 

1 f 00 dt 

\L(H)\^-^ \H(t)\^. (4.17) 

Indeed, first of all note here that the factor a can be easily moved, in a way, from the argument 
of the general and hard to control function H into that of the much simpler and more specific 
exponential function, using the simple identity 

0(ff(a#))(*) = <5 (#(#))(g), (4.18) 

which implies that 

i f 00 dt 
L(H) = — I(tx)H(t) — , where 
2ir t 

I(u) := f {e~ iu/a - e~ iu ) 3a 2 da. 
Jo 

Now (4.17) follows immediately from 

Proposition 4.4. The expression g{u) := \I(u)\ 2 is even inueM. and (strictly) increases from to 
1 as \u\ increases from to oo ; in particular, it follows that \I(u)\ e [0, 1) for all real u. Moreover, the 
function g has the following generalized concavity property: —u 3 (u~~ 5 g'(u)) is completely monotone 
in u > (in Bernstein's sense - see e.g. [27, Chapter 2]); in particular, g(v l l & ) is concave in v > 0. 

Thus, the conclusion in Proposition 4.4 that |/(u)| £ [0, 1) for all real u can be seen as a rather 
sophisticated replacement for the trivial identity |e~ ite | = 1 for real t and x, which latter was used 
to obtain the rightmost bound in (4.16). 

Moreover, one can easily obtain (and then use in (4.19)) an upper bound on |I(«)| which is 
significantly less than 1 for small enough values of \u\. This can be done by closely bounding the 
values of \I(u)\ for a finite number of values of u and then using the monotonicity property of |/| 
provided by Proposition 4.4. 

Graphs of He I, ImJ, and \I\ over the interval [— 6ir, 6ir] are shown in Figure 1. It seems plausible 
that g{u) is concave in u > 0; however, that probably would be hard to prove. 

Proof of Proposition 4-4- Note that I(—u) = I{u) for all real u. So, the function g = II is indeed 
even. 

Take now any real u > 0. Integrating by parts and then changing the integration variable, one 
has 

f 1 f 00 dz 

iu- 3 I{u) = u- 2 e- iu/a ada = £{u) :=£ 3 {u), where £/(«):= e~ iz — (4.21) 

JO Ju z ° 



(4.19) 
(4.20) 
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Fig 1: Graphs of Re/, Im/, and |/| 



for j > 0. So, g{u) = u £(u)£(u), 



gi {u) := \ u- b g'{u) = 3£(u)£(u) - u~ 2 Re (e iu £(u)), 
\ - Re ((4 + iu)e iM £(u)) 



« 3 5i(m) 



1 



Re((4 + iu) f 

Ju 

f 

Jo 



Re (4 + iu 



00 rl ■? 

i(z-u) 

Z 3 

di> 



rCO rCO rCO 

J e~" s sds-Re((4 + iu) J e^dvj i e -^+ u ) s s 2 ds) . 



Noting now that \™ e~ iv e~ vs dv = -L and Re - 4s±u 

3 2 

iOi(a) := and io 2 (s) := write 



s ^j I Y for all real s > 0, and introducing 



rCO rCO 

2uVi(«)= e" us u;i(s)ds+ e - us w 2 (s)uds 
Jo Jo 

poo pc 

e- us Wl (s)ds + 
Jo Jo 



" ns u;Us)ds = -2 1 



f 

Jo 



s 5 ds 



which verifies the last sentence of the statement of Proposition 4.4; the second equality in the above 
display was obtained by taking the integral ^° e _MS u; 2 (s) u ds by parts. Moreover, it follows that 
gi(u) is decreasing in u > 0. At that, gi(oo— ) = 0, since £(oo— ) = 0. So, on (0, oo) one has the 
following: g\ > and hence g' > 0, and therefore g is increasing. Since g is even and obviously 
continuous, it follows that indeed g(u) increases in |it|. Clearly, g(0) = |/(0)| 2 = 0. It remains only to 
show that g(co—) = 1. Toward that end, integrate by parts to obtain the recursive relation £j(v) 
—ie~ lu u~ 3 + ij£j+i(u) for all j > 0. In particular, it follows that £3(11 
and £4 (it) < u~ 4 



ie -iu u -3 + 2,i£ A {u) 



-iu 3 (e m + o(l)) and g(u 



£ 5 (u)\ < u~ 4 + j§ < u~ 4 . Thus, £{u) = £ 3 (u) = -ie- lu U -' s + 0{ U - 4 ) 
u 6 \£(u)\ 2 — > 1 as u — > 00. 



□ 



One will also could use a better upper bound on |/(i)| for a given real value of t, where / is the 
c.f. of a r.v. X, say with EX = 0, EX 2 = 1, and a given value of p := E |X| 3 . Since 



\f(t)\ = V E 2 cos tX + E 2 sin = sup (cos# EcostX + sin0 EsiniX) = sup Ecos(tX 

6»e[0,27r] 9e[0,2n] 



(4.22) 
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the best upper bound on \f(t)\ under the given conditions is 

S[t,p) :=sup{|/(t)|: EX = 0,EX 2 = 1,E|X| 3 = p) (4.23) 
= sup{Ecos(iX -&): EX = 0, EX 2 = 1,E|X| 3 = p,cardsuppX < 4,9 e [0,2tt]} 

r 4 4 4 4 4 

= sup] J]pjCos(tXj -9): YiPj = IjEPj^j = >TiPj x2 i = l iY,Pj\ x j\ 3 = P, 
1 l l l 11 

p 1 ,...,P4>0, [0,2tt]}; 

cardsupp denotes the cardinality of the support of (the distribution of) X; for the second equality 
here, one can use the known results by Hoeffding [11] or Karr [12] or, somewhat more conveniently, 
Winkler [46] or Pinelis [31, Propositions 5 and 6(v)]. Thus the optimization problem reduces to one 
in 9 variables: pi, . . . ,p^, x\, . . . , X4, 9; in fact, one can easily solve the linear (or, more precisely, 

affine) restrictions YliPj = !>YliPj x j = 0>XliPj x f = 1> Xli p/l^jl 3 = P f° r Pij • • • >P4i an d then 
only 5 variables will remain: x±, . . . , X4, 9, with the additional restrictions on x\, . . . , 2:4 to provide 
for the conditions Pi, ■ ■ ■ ,P4 ^ 0. For any given pair of values of (t,p), it will not be overly hard 
to find a close upper bound on the supremum S(t,p). A difficulty here is that one has to deal 
with two parameters, t and p, and obtain a close majorant of S(t, p) with, at least, discoverable 
and tractable patterns of monotonicity/convexity in t and p, if not with a more or less explicit 
expression. Apparently the main difficulty in dealing with S{t, p) will be that the target function 
cos(£# — 9) oscillates, whereas the function (# — w)\ in [34, Lemma 3.4] is monotonic. 

Similar methods can be used to find a good upper bound on \ f(t) — g(t)\, where g = e~* 2 / 2 , the 
c.f. of the standard normal r.v. Z; in particular, one can start here by writing 

\f(t)-g(t)\= sup (Ecos(tX-0) - Ecos(tZ-9)) = sup (Ecos(tX - 9) - g(t) cos 9) 

0E[O,27r] 6»e[0,27r] 

in place of (4.22). At this point, one also has an option to use Stein's method to bound Ecos(tA — 
9) - Ecos(tZ-9). 



5. Constructions of the smoothing filter M 

The following proposition was somewhat implicit in the paper [3] by Bohman. 

Proposition 5.1. Letp be any symmetric probability density function (p.d.f.) such that the function 
H^pijf) is integrable on M.. Take any real 

1 . . 

\ R \x\p{x)dx 

Letp stand, as usual, for the Fourier transform of p [so that = \ R e tx ^p{x) dx) , and let then 
p' denote the derivative of p [which exists, since ^ R \x\p[x)dx < 00). Then the function 

M := p + inp' (5.2) 

is such that inequalities (2.2) hold for all r.v. X, all real T > 0, and all real x. 

Because of the symmetry of p, in the conditions of Proposition 5.1 the function p = Re M is even 
and np' = ImM is odd, so that conditions (2.5) hold. In order to satisfy the conditions (2.1) and 
M e C 3 as well, one may choose the symmetric p.d.f. po,2 defined by the formula 

32vr 3 1 - cosx 

PoAx) := -IT xHx *-^f (5 - 3) 
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for real x f {— 2tt, 0, 2tt} and then let M be as in (5.2) with p = po,2'- 

M = M ,2 := poi + iKpoj. ', (5.4) 

with any 

k^«o,2 := r | | 1 , > , =0.3418.... (5.5) 
Then M 0i2 e C 3 , since M'" = p"' + inp"" and 

x 4 po^(x) dx < oo. (5-6) 



j 

Jit 



Moreover, it is clear that po,2 is the restriction to M of an entire analytic function of exponential 
type 1; so, by the Paley-Wiener theory (see e.g. [7, Section 43]), the condition (2.1) holds as well, 
with Mo 5 2 in place of M. In fact, 

„ , , / \ , , (2 + cos27rt . . ., sin27r|t| \ _.. . „ . , . . 

ReM , 2 (t) =Po,2(*) = ( 3 (1-1*1)+ 27r ) H\t\ < 1} and (5.7) 

ImM , 2 (i) = «Po5'(t) = -k^^ [2vr(l - \t\) sin 2n\t\ + 4 sin 2 (vrt)] I{|i| < 1} (5.8) 
for all real t. Graphs of po,2, PeMo,2, and ImMo,2 with tz = kq,2 are shown in Fig. 2. 



p , 2 (x) Re M _ 2 (f) Im M o , 2 (0 




Fig 2: Graphs of po,2, ReMo^, and ImMo.2 with k = kq.2- 



More generally, in order that a function M as in (5.2) satisfy the conditions (2.1) and M e C 3 , 
it is enough that p be smooth enough and such that (2.1) holds with p in place of M. Therefore, 
the following well-known characterization is useful. 

Proposition 5.2. (See e.g. [14, Theorem 4-2.4]-) A function f : R —* C is the c.f. of an absolutely 
continuous distribution on R i/ and on/y if /(0) = 1 and / = g *g~ for some (possibly complex- 
valued) function g e L 2 (M). Here and in the sequel, as usual, the symbol * stands for the convolution, 
the bar denotes the complex conjugation, g~(if) := <?(—#), and g~ := (g)~ = g- . 

Indeed, take any smooth enough nonzero function g: M — > C such that g(t) = for all real 
t $ [a,b], where a and b are any real numbers such that a < b. Then, by Proposition 5.2, the 
function / := g * g~/\\g\\2 is the c.f. of an absolutely continuous distribution on R, / is smooth 
enough, and f(t) = for all real t $ [-T, T], where T := b — a. At that, if g is real-valued, then / is 
even. To spell-out the "smooth enough" condition and conclusion here, one can easily check that, 
if g 6 and h e C k for some j and k in N and (say) \g(t) \ + \ h(t)\ = for some real T > and all 
real t $ [0,T], then g * h e C j+k , with {g * h)^+^ = gV) * h^ k \ 

One can use Proposition 5.1 and Proposition 5.2 to optimize properties of the filter M - say by 
taking g to be an arbitrary nonzero real- valued spline of a high enough order and/or with a large 
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enough subintervals of the interval [0, T], extending g to K by letting g(t) := for all real t $ [0, T], 
letting then / := g * ^ / II <? II 2 ' defining M as in (5.2), and finally (quasi-)optimizing with respect to 
the parameters of the spline. 

While the construction described in Proposition 5.1 is comparatively simple, it appears somewhat 
too rigid and wasteful. Indeed, in order that the imaginary part M2 = up' of the function M in (5.2) 
be thrice differentiable (as needed or almost needed in the quick proof beginning on page 10) , the 
real part Mi = p of M must be four times differentiable; equivalently (cf. (5.6)) , the density p must 
have light enough tails so that $ R £ 4 p(x) dx < go. Together with the filtering condition (2.1), the 
condition of extra smoothness of M\ = j5/extra lightness of the tails of p may result in a smoothing 
filter M which is not as good as it can be, thus compromising the quality of the approximation by 
the upper and lower bounds in (4.9). 

A more flexible and potentially better construction of the smoothing filter M can be given as 
follows. As in Proposition 5.1, let us start with an arbitrary symmetric p.d.f. p, whose Fourier 
transform p is intended to be Mi = ReM. Accordingly, let us assume right away that 

p{t) = if |i| > 1; (5.9) 

cf. (2.1). Note that the smoothing filter as in (5.2) is the Fourier transform of the function 

x h-» p(x)(l — kx), (5.10) 

which differs relatively much from the "original" p.d.f. x >-* p{x) when \x\ is large. To address this 
concern, let us replace the "large" factor x in (5.10) by a "tempered" and, essentially, more general 
factor G{x) such that G: M —* M is a strictly increasing odd function of bounded variation, whose 
Fourier-Stieltjes transform dG(#) = J R e iX * dG(x) satisfies the condition 

dG(t) = if \t\ > 7, (5.11) 

for some real 7 > 0. The no-high-frequency-component condition (5.11) implies, by the mentioned 
Paley-Wiener theory, that the function G is the restriction to R of an entire analytic function 
of exponential type 7 and, in particular, is infinitely many times differentiable. Without loss of 
generality, assume that the function \ + G is a d.f. 

As mentioned above, instead of the "harsh" tilting (5.10) of the p.d.f. p, we consider the "tem- 
pered" tilting: 

x p(x) := p(x) (l — kG(x)), (5-12) 

for any real 

k> k - 1 (5 13) 

2 ^ Q p(x)G(x) dx 

Note that ^ G(x) < \ = G(oo-) for all real x > 0; also, as discussed previously, the condition (5.9) 
implies that p is the restriction to M. of an entire analytic function, and so, p > almost everywhere 
on M.. Therefore and by the symmetry of p, one has < ^ p(x)G(x) dx < | ^ p{x) dx = \ and 
hence > 2 and k > 2. It follows that there exists a unique root x K e (0,oo) of the equation 

1 - kG(x k ) = 0. (5.14) 

Hence, p > on the interval (—go, x K ] and p < on [x K , 00), so that the function 

r# 

F(#) := p(y)dy (5.15) 

J— 00 
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is nondecreasing on (— co,x K ] and nonincreasing on [x K ,cc). At that, F(cc— ) = 1, since p is an 

even p.d.f. and the bounded function G is odd; also, clearly F(— 00+) = 0. Moreover, F(Q) = 
\ ~ K f-^v{x)G{x)&x = \ + k\™ p(x)G(x) dx = \ + ^ 5= 1. It follows that 

I{y > 0} sS for all real y. (5.16) 

Let now X be any r.v. and let / by its c.f.: 

/(#):= Ee* x #. (5.17) 
Then P(X < x) = E I{x - X ^ 0} < E F(x - X), by (5.16); that is, 

P(X^x) s; f F(x-y)P(Xe dy), (5.18) 
Jr 

for all real x. Define now M as the Fourier transform of p, so that 

M = p, Mi = ReM = p, and M 2 = ImM = kpG, (5.19) 

by (5.12). Note that the Fourier-Stieltjes transform of the function J R -F(# — y) P{X e dy) is 
the Fourier transform of — y) P{X e dy), which in turn is pf = Mf. Then, in view of 

Proposition 4.2, (5.18) means that the last inequality in (2.2) holds for T = 1; that it holds for any 
real T > now follows by simple re-scaling, since (5.16) obviously implies I{y > 0} < F{Ty) for all 
real y and all T > 0. Similarly or using the reflection x >— > —x, one can see that the first inequality 
in (2.2) holds as well. 

To compute M 2 in (5.19), we need to express pG in terms p and dG. To simplify the derivation, 
assume the condition p e C , as well as the previously stated conditions (5.9) and (5.11); these 
conditions will hold in the applications anyway. Then one can see that for all real u 

pG(u) = ± p.v. f p(u - s)dG(s)— = J- f P^-s)-p(u) ^ dg _ 

the latter equality here holds because the function G was assumed odd, and hence the function 
dG is even; the latter integral in (5.20) may be understood in the Lebesgue sense, in view of the 
conditions p e C 1 and (5.11). It follows from (5.20), (5.9), and (5.11) that 

pG{t) = if |i|>l + 7. (5.21) 

To verify the first equality in (5.20), one can write 

p{x)G{x) = (i- J" e -**j>(t)dt) (^P-v.£° e- isx dG(s) 



l C ioT'^^i/ \ dS 

-00 * 

ir. j r J - . . ds 

e — dn p . v . p (u _ s)dG(s) _ 

^ Jr 27T J_ o0 s 



the second equality here is justified because of the second equality in (5.20) and the inequality 
j ^ fctlW dG( s ) ds\ < 2 7 I{|n| sc l + 7 }max| t | <1 \p'{t)\ for all real u. 

Note that the first integral in (5.20) is a convolution. One can also integrate by parts to represent 
pG as a convolution-smoothing of the derivative p'\ 

pG=£p'*dG, (5.22) 
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where 



~ f* - — - ds f - — - ds / — 1*1 ^ — - ds 

dG(t) := p.v. dG(s) — := lim dG(s) — = dG(s) — I{|t| < 7} (5.23) 

J-00 s £ i° J(-ao,t)\(-e,e) s J-7 s 

for all real t. This follows because for all real u 



f ^ U ^ ^ dG(s) ds = f dG(s) — ( f V(u)du I{s < 0} - P p» dt> I{s > 0} 

JR s JR s \Ju Ju-s 

f 00 rU-V J ru rco 1 

= p'(v)dv\ dG(s)—-\ p'(v)dv\ dG(s) — 

Ju J — co $ J— 00 Ju—v * 

= p(v ) dv dG{u — v), 

JR 

since the function dG is even. The latter condition or (5.23) also shows that the function dG is 
even. Moreover, since dG(s) — > 1 as s — > 0, (5.23) yields 

dG(t)~ln|t| (5.24) 

as t — > 0; thus, the function dG is mildly singular in a neighborhood of 0. For instance, dG(i) 
: 1 - \t\ +ln|t|)(I{|t| < 1} if dG(i) = (1- |t|)(I{|t| < 1}. 

Note also that in the case (prevented by the condition (5.11)) when dG = 1 on R, the function 
—2ipG would be the Hilbert transform of the function p; see e.g. [42, Chapter V]. 

It follows from (5.19) and (5.21) that 

M 2 (t) = if |t|>l + 7. (5.25) 

This condition on M2 is obviously weaker than the condition 

M 2 (t) = if \t\ > 1, (5.26) 

following from (2.1) and (2.5). However, by (5.19) and (5.9), one still has M\{t) = if |t| > 1, 

whereas the condition (5.26) was used in the quick proof beginning on page 10 only to bound two 

terms, ©2o(/3i) and ©2a(/32)- Therefore, one may expect the adverse impact of the weakening of 

the condition (5.26) to (5.25) to be rather limited and likely more than compensated for by the 

advantages provided by the more flexible construction of the smoothing filter M, with the tempered 

tilting of M\. Moreover, the latter construction is, essentially, more general, Indeed, for instance, 

one may always include G into the scale family (G Q ) a> o := (G(^)) >q , and then the tempered 

tilting (5.12) will be close to the harsh tilting (5.10) for large a > provided that G'(0) ^ 0. 
G' ( 0) 

Indeed, G a (x) ~ g ' x for each real x as a — > 00; of course, at that the value of k = n a in 
(5.12) with G = G Q will be quite different from that in (5.10); in fact, the value of in (5.13) with 
G = G a will then be asymptotically equivalent to the value of k* in (5.1) times g /|U , provided that 
J R |x|p(x) dx < 00. At that, the value of 7 in (5.11) for G will be replaced by the corresponding 
value 7 a := ^ for G a , so that 7 Q — > as a — > 00. 
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